# RLHF alignment
Llama 3.1 405B Instruct
Llama 3.1 is a multilingual large language model series developed by Meta, including 8B, 70B, and 405B scales, supporting multilingual text generation and code generation tasks.
Large Language Model
Transformers Supports Multiple Languages

L
meta-llama
34.83k
569
Llama 3.1 405B FP8
Meta Llama 3.1 is a multilingual large language model collection, including 8B, 70B, and 405B parameter pre-trained and instruction-tuned generative models, supporting 8 languages with outstanding performance on industry benchmarks.
Large Language Model
Transformers Supports Multiple Languages

L
meta-llama
540
115
Gpt2 Large Harmless Reward Model
MIT
A large GPT2 model trained on the Anthropic/hh - rlhf harmless dataset, specifically for harmful response detection or reinforcement learning from human feedback (RLHF).
Large Language Model
Transformers

G
Ray2333
1,489
3
Featured Recommended AI Models